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ABSTRACT 



Context. Transit detection algorithms are mathematical tools used for detecting planets in the photometric data of transit surveys. In 
this work we study their application to space-based surveys. 

Aims. Space missions are exploring the parameter space of the transit surveys where classical algorithms do not perform optimally, 
either because of the challenging signal-to-noise ratio of the signal or its non-periodic characteristics. We have developed an algorithm 
addressing these challenges for the mission CoRoT. Here we extend the application to the data from the space mission Kepler. We 
aim at understanding the performances of algorithms in different data sets. 

Methods. We built a simple analytical model of the transit signal and developed a strategy for the search that improves the detection 
performance for transiting planets. We analyzed Kepler data with a set of stellar activity filtering and transit detection tools from the 
CoRoT community that are designed for the search of transiting planets. 

Results. We present a new algorithm and its performances compared to one of the most widely used techniques in the literature using 
CoRoT data. Additionally, we analyzed Kepler data corresponding to quarter Ql and compare our results with the most recent list of 
planetary candidates from the Kepler survey. We found candidates that went unnoticed by the Kepler team when analyzing longer data 
sets. We study the impact of instrumental features on the production of false alarms and false positives. These results show that the 
analysis of space mission data advocates the use of complementary detrending and transit detection tools also for future space-based 
transit surveys such as PLATO. 

Key words. Methods: data analysis; Techniques: photometric; Stars: planetary systems; Stars: binaries: eclipsing 



1. Introduction 

Transit detection algorithms are mathematical tools that aim at 
detecting the signal of a transiting planet in the photometric time 
series of a star (light curve). The generally accepted characteris- 
tics of the planet transit are a short, small, periodic decrease of 
the (assumed) constant luminosity flux of the star caused by the 
interposition of the opaque planet in front of the stellar disk, as 
seen from the point of view of the observer. Transits are short 
because the duration of a transit is roughly proportional to the 
quotient of the radius of the star over th e semi-major axis of the 
orbit of the planet (tr/P ~ R,/na; see lSeager & Mallen-Ornelasl 
I2003h . This means that even for the shortest known period 
ex oplanets orbiting at 3 or 4 ste llar radii, such as 55 Cnc 
(IWinn et all \20Uh, WA SP- 19b dHebb et alj l20Toh. WASP- 



43b dHeUier etal J 1201 lh. Kepler- 10b dBatalha et al.l 120111) . 
CoRoT-7b dLeger Rouan, Schneider et al. 20091) . or WASP- 1 8b 
dHellier et all2 009) (all of them with orbital periods shorter than 
24h) the duration is shorter than one tenth of the orbital period. 
Transit depths are small because the diminution of the flux is 
proportional to the square of the quotient of the radii of the planet 

and the star: AF/F ~ (R p /R,f. 

The arr ival of the space- based surveys of exoplanets such 
as CoRoT dBaglin et al.ll2006h and Kepler dBorucki et alJ feOlO) 
has placed transit surveys in the region of the parameter space 
where transits are not short, in terms of hours, any more. For 



example, the transit of CoRoT-9b dDeeg et al.ll2010l) lasts 8h, 
but there are also detections of giant eclipsing binaries where 
the eclipses last several days, like CoRoT 101126445, whos e 
eclipses last 49h for a period of 30 days dCabrera et al J 12009). 
At the same time, the detected signals are not weak either. In 
this case there are two tendencies, on the one hand toward re- 
ally shallow transits, such as those of the terrestrial planets 55 
Cnc e, CoRoT-7b or Kepler- 10b, and on the other hand toward 
transits of small planets around small stars, which enables de- 
tecting Earth-sized planet s from the ground, such as GJ-1214b 
dCharbonneau et alJl2009h . Finally there are detections of tran- 
sits that ar e not periodic anymo re, the transit timing variations of 
Kepler-9c (H olman et alj|2010l) reach an amplitude of 140 min- 
utes, comparable with the length of the transit, in a time span of 
200 days. 

Space surveys of transiting planets such as CoRoT and 
Kepler have improved our knowledge about the performances of 
transit detection techniques. Between 2004 and 2007 the CoRoT 
community made a series of studies to test the transit detec- 
tion capabilities of different alg orithms. The analysis of th e re- 
sults of these tests, published bv lMoutou et alJ d2005Ll2007l) . re- 
vealed on the one hand that all algorithms had competitive per- 
formances and yielded comparable results, but on the other hand 
it was shown that both false alarms and non-detections were 
method dependent. The transit detection approach followed by 
the CoRoT community is based on the results of these analyses: 
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combine the results of different teams using different detrending 
and detection techniques to reduce the rate of false alarms and 
to push the detection limit as far as possible. 

The Kepler survey has followed a different approach using 
one single detrendi ng and detection algorithm ( I Jenkins! [2002; 
I Jenkins et al.l l2010b) w hose performance has been extensively 
studied and tested dJenkins et al]|2b 02. 201 ( Jab . In view of future 
transit search missions, e.g. PLATO (Catala 2009), it is interest- 
ing to compare the different approaches. 



are several tools in the literature dealing 
automatic detection of transiting candidates 



There 
with the 

dAigrain & Favatal l2002t iBonomo & Lanza! 120081: iBorde et al 
2007; Carpano & Fridlund 2008; I Collier Cameron etal 
2006 : iDefavet alJ 120011: iGrziwa et al.1 l201 ~ 
19961; Uenkinsl 120021; IKovacs etalT 1200 
2005 



Protopapas et al. 



Regulo et al. 



iJenkins et al 
JOfiri 120081: 



iRenner et al.l 
In addition to 



2008; Schwarzenberg-Czernv & Beaulieu 2006 
those, ISchwarzenberg-Czernvl d!998l) gives a very interesting 
discussion on the general techniques for the detection of 
periodic signals. 



The code presented in this paper has been used regu- 
larly since 2006 in t he CoRoT survey f or transiting planets 
dCarpano et ail 120091: ICabrera et all 120091: lErikson et all 120121; 
ICarone et al.ll2012t ICavarroc et al.ll2012l) . The algorithm, called 
DST, (Detection Specialisee de Transits) addresses some of the 
challenges that face current and future transit surveys. This al- 
gorithm aims at a specialized detection of transits by improving 
the consideration of the transit shape and the presence of transit 
timing variations. As a first step, we have applied our CoRoT 
detrendin g and transit de tection tools to the public data release 
Ql dBorucki et al.ll201 Tab and compared t he results to the list of 
planetary candi dates published by Keple r dBorucki et al.ll20lTbl 
hereafter IB 1 lbL and lBatalha et al.ll2.Q12h . We aim at testing the 
capabilities of a different algorithm to find new transit candi- 
dates in a reduced data set, because we also aim at understand- 
ing the dependence of the yield as a function of the length of 
the observations. Su ch a study was already started in CoRoT 
dCabrera et al.l l2009) and it is of great interest when deciding the 
observation strategy of transit surveys, for example when decid- 
ing the optimimum length of an observational run. On the other 
hand, it is important to find successful algorithms that are able 
to detect planets in data sets with a reduced number of transit 
events. This is the case of modern transit surveys aiming at de- 
tecting planetary candidates with periods comparable to the total 
length of the observations, for example planets in the habitable 
zone of solar-like stars. 



Section [2] contains the mathematical description of the al- 
gorithm. Section [3] compares the performance of t he new algo- 
rithm with a widely used technique, the BLS code dKovacs et alJ 
I2002h . We do not aim to reproduce the algorithm comp arisons 
found i n the literature such as the ana lysis made by iTinglevI 
d2003albh or lMoutouetail d2005l 120071) but to present the dif- 
ferences of this new code with respect to one of the most widely 
used codes (BLS) and explain its advantages. Section|6]describes 
the treatment applied to XepJerQl data. Section|7]describes the 
results of the test, including a number of new transiting candi- 
dates. Finally, Section [8] summarizes the outcome and describes 
future developments. 



2. The transit search algorithm 

A light curve is a time series of the photometric observations of 
a star. Mathematically, it can be seen as a collection of pairs of 
time-flux, which can be translated into a function like 



g(x) = Y^giUhix-xi); 

where (x,-, g,) are the N pairs time-flux and 

n ( - x ) = I x ' ' ~ x < Xi + h; 

h '' 1 0, elsewhere, 



(1) 



(2) 



with h being the integration time. g(x) should not be understood 
as the function representing the continuous time waveform cor- 
responding to the photometric measurement, but as a mathemati- 
cal tool to represent photometric observations. If G(t) represents 
the continuous waveform coming from the star, gj is defined as 



l r x>+h 

h L 



G(t)dt. 



(3) 



The BLS dKovacs et al.l l2002) is arguably the most popular 
transit detection algorithm in the literature. It proposes a box 
model for the transit shape, which can be described as 



f(x) = 



L, X/ e T 
H, elsewhere, 



(4) 



where H is the mean value of the flux out of transit and H - L is 
the depth of the transit, which occurs in the interval T. The test 
statistic defined by BLS is the distance between the functions / 
and g, defined as 

D 2 = J cj(f-g) 2 , (5) 
where we have defined the weight function analogously to g 

N-l 

co(x) = 2^ cot Yl h (x - xi), (6) 



keeping the definition of IKovacs et al.l (|2002): w,- = 

<x7 2 [Z^o °"J 2 ] ■ It i s trivial to prove that $5$ is the product 
of the constant h (the i ntegration tim e ) time s the test statistic de- 
fined by equation 1 in IKovacs et al.l (|2002). Expression (0 can 
be minimized analytically with respect to the variables H and L 
once the transit region T is known: 



N-\ 



H-- 



i=0 



ieT 



N-l 
i=0 
ieT 



ieT 



(7) 



Z 

ieT 



and the BLS test statistic is 

(N-l 



N-l 



i=0 ieT ) V ieT 



(8) 
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which is further simplified by making J^f =Q (Oigi = 0, and 
w,- = 1 ; this finally produces equation 4 in IKovacs et al.l 
(200J). 

In practice, as the ephemeris of the transit (period P, epoch 
xq and transit length d) are not known a priori, one folds and 
bins the light curve to a test period and then calculates the test 
statistic for every possible value of xq and d. The minimum of the 
test statistic corresponds to the ephemeris of the transit. If there 
is no significant extreme value of the test statistic, the conclusion 
is that there is no detectable transit in the light curve at the tested 
period. 

We can define an alternative model to the transit function us- 
ing a parabola instead of a box function and modifying the def- 
inition of the region in transit. The advantages of these choices 
are shown in Section [3] The model function will be 



/(*) = 



46k 2 
H - 6 k + — r (x - Xok) , xeT k ; 



H, 



4 



(9) 



x ( T k . 



We assume that there are K observed transits, each transit de- 
noted as T k centered in the position x a k- The definition of the 
transits is discussed in the next section. The duration of the Mi 
transit 7* is given by dk and its depth by 8k- The definition of the 
distance between the functions / and g is identical to (O and its 
minimization with respect to the variables H and 5 k yields the 
following constrain: 



N- 1 



H- 



1=0 k 



N-i 



i=0 



(10) 



HAl 



A 



where 



sK 



ieT t 

fez* 



d\ 

4 + 12 



Xi X Q k 



Xi Xok ' 



12 



\(h 2 -d 2 )[x r 



(11) 



l 

H — 



10 



d\h 2 d\ ^ 

~ + T 



Using the above constrains, the distance between the functions 
/ and g becomes 



D 2 =h 



N- 1 



N-l 



(12) 



However, contrary to IKovacs et al.l d2002l) . we are not go- 
ing to define the test statistic as the distance between the func- 
tions / and g (or, in the case of BLS, the signal residue re- 
sulting fro m subtracting the consta n t term 2£o' fr° m ©■ 
Following Schwarzenberg-Czernyj (1 19981) . for a signal x that 
is decomposed into its model X\\ and residuals x ± = x — X\\, 
the statistic used to assess if the model is an appropriate de- 
scription of the data will be ||x|||| 2 /||ji[;jj| 2 , where |br|| 2 = YuX 2 . 



As indicated in the previously cited paper, we can draw the 
same conclusions with ot her statistics provided that they are 
uniquely related (see also lJenkins etal.ll 1996t lTinglevl l20Q3b; 
Schwarzenber g-Czernv & Beaulieul l2006l) . Inour notation, x = 
g and jf || = /, whereas T) — \\f — g\\ — \\x - x\\\\ = \\x ± \\; therefore 
x±\\ 2 = \\f\\ 2 /D 2 . In this case 



the statistic is Hxy 

(n-i . s 
\\f\\ 2 = h ^ igi + J]-£g^k 

V i=0 k 

and finally the test statistic is 

45* 



(13) 



mi8i + L -jz gA > 



i=0 



D 2 



N-i 



N-i 



(14) 



i=0 



i=0 



3. Discussion of the transit search algorithm 

The test statistic defined by (TBI is more complicated than de- 
fined by BLS (see [8) and hence it involves a heavier computa- 
tional effort. However, it has four main advantages: 1) it provides 
a better description of the transit shape, which in turn provides 
a better behavior of the algorithm in the presence of transits; 2) 
it uses the same number of free parameters as BLS; it is always 
possible to improve the model of the transit shape by using more 
free parameters (for example, a trapezoid has one more free pa- 
rameter: the duration of the ingress phase), but at the expense of 
computational effort; 3) our estimator performs better than BLS 
for the reasons explained below, and 4) the definition of the re- 
gion in transit is more flexible. 

To illustrate these assertions, we have analyzed public data 
from the satellite CoRolQ with both BLS and the new method 
described in this paper (DST) and compared their performance 
in two test cases. In the first one (Subsections 13.11 and 13.21 1. 
we show how the DST model of the transit shape improves 
the transit detection efficiency. We have chosen two typical 
targets of transit surveys for this test, a planet, CoRoT-lb 
(Barge et al. 2008), and an eclipsing binary, CoRoT 102763847 
(ICarpano et al.ll2009l) . of similar depths and durations, although 
different periods. We chose an eclipsing binary because although 
transit surveys aim at detecting planets, they find many eclips- 
ing binaries as a by-product. The probability to detect eclipsing 
binaries is relatively higher because they are numerous and be- 
cause the S/N ratio of eclipses is in general stronger than the 
S/N ratio of a planetary transit, since in stellar binary systems 
both members have in general comparable sizes. By choosing 
a planet and an eclipsing binary of similar depths and bright- 
ness we can compare the S/N ratio of the detection in typical 
targets of planet detection surveys. Both targets were observed 
at the CoRoT sampling rateflof 32s and 512s and the S/N ratio 
achieved are representative of typical giant planet signals in tran- 
sit surveys. This first test case consists of the analysis of a sec- 
tion of the light curve containing only one transit, respectively 



1 CoRoT public data is available thorough the CoRoT Data Archive 
hosted by IAS: http://idoc-corot.ias.u-psud.fr . 

2 CoRoT-lb was obse rved at 512s and 32s sampling rate during the 
run IRaOl, described in Carpan cTet alJ d2009t> . where it got the win_id 
IRa01_E2_1126. The eclipsing binary with CoRoT id 102763847 was 
observed at 32s sampling rate during the run IRaOl, where it got the 
win_id IRa01-El_l 158, and at 512s sampling rate during the run LRaOl 
dCarone et alj2012h . where it got the win.id LRa01.E2_0588. 
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one eclipse. We implemented the BLS and DST algorithms^ and 
we applied them on the same data set to compare their perfor- 
mances. 

But in addition to Jupiter planets, transit surveys also aim 
at detecting small terrestrial planets. In our second test case 
(Subsection 13.31 ) we compare the detectability with BLS and 
DST of CoRoT-7b, representative of the dim signals of Earth- 
sized planets. In this case we performed a blind search for the 
period of the transiting planet using the same filtered light curve 
for both algorithms. 

3. 1 . Test case I: the transit shape 

A second-order polynomial is expected to yield a better descrip- 
tion of the transit shape than a box model because it includes a 
description, although incomplete, of the ingress and egress of 
the transit, which the box-shaped description simply ignores. 
The number of free parameters in DST and BLS is the same: 
the value of the flux out of transit H, the depth of the transit 
6k — H - L, and the ephemeris of the transit (period P, epoch xq, 
and duration d). This is in contrast to the trapezoidal model of 
the transit, which involves one more free parameter (the duration 
of the ingress phase). As we are analyzing the data of a single 
event, we fix the value of the period in the following analysis . 

We used the BLS and the DST algorithms to fit the afore- 
mentioned free parameters around the position of the chosen 
events. We show in Fig. Q] the residuals of the fit of the CoRoT- 
lb transit sampled at 512s with BLS and DST. Figure [2] shows 
the same residuals for the 32s sampling. These figures show that 
BLS fails to reproduce the transit shape in the ingress and egress, 
as expected. In Section|2]we have shown that the statistic used by 
BLS and DST is the distance between the observed data points 
and the model function as described by expression [5] Table [1] 
shows that DST is indeed a better description of the transit and 
eclipse signal. When the S/N ratio is high (in the 512s sampled 
ratio), DST produces 68% and 59% less residuals than BLS (for 
the planet and the eclipsing binary, respectively). When the S/N 
ratio is lower (in the 32s sampled ratio) the difference is less ac- 
centuated and DST performs around 14% better in both targets. 

It might be rightfully argued that the parabolic is not the 
best description of the^af bottom feature of planetary transits. 
Indeed, there are functions that better reproduce the shape of 
long transits, where the contribution of the flat part is most rele- 
vant, such as the cases of the already discussed CoRoT-9 b or, fo r 
an example of a super-Earth, Kepler-22b dBorucki et alj|2012h . 
Figure [3] shows the binned light curve of Kepler-22b modeled 
with the parabolic shape described in this paper (labeled x 2 ), 
with a quadratic shape (labeled x 4 ) described by equation[T5lbe- 
low, and with BLS. Indeed, Kepler-22b, a ~ 2 Earth radii planet 
with an orbital period of 290 days is not a challenging detection 
for DST considering the distinct signal produced by the planet 
in the periodogram (see Fig.|4|. 

{ 16(5* 4 

/*w=r~* + T ,xer * ; (i5) 

[H, ' xi T k . 

If we compare the residuals of these models, we immediately 
see that most of the residuals of BLS come from the ingress and 
egress phases, as was the case with CoRoT-lb above. We can 
also verify that the x 4 model performs slightly better than the x 2 
model, but we do not obtain the same degree of improvement 

3 the code is available on request to the author. 
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Fig. 1. Residuals of the BLS and DST model of CoRoT- lb sam- 
pled at 512s. The central time of the transit chosen has a Julian 
Date value of 2454 142.8547 days. The dotted vertical lines in- 
dicate the duration of the transit. 

residuals of the fit to the transit of CoRoT-1b (32s sampling) 
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Fig. 2. Residuals of the BLS and DST model of CoRoT-lb sam- 
pled at 32s. The central time of the transit chosen has a Julian 
Date value of 2454 191.1413 days. The dotted vertical lines in- 
dicate the duration of the transit. 



Table 1. Relative residuals of the BLS and DST models of the 
CoRoT-lb and CoRoT id 102763847 targets in parts per million. 



target 


j'(f-gBLs) 2 / j'f 


J (f - gOST? 1 J f 


CoRoT-lb(512s) 


7.7- 10~ 6 


2.4- 10~ 6 


CoRoT-lb(32s) 


8.710 6 


7.5- 10~ 6 


102763847 (512s) 


2.M0- 6 


0.8- 10~ 6 


102763847 (32s) 


7.2-10- 6 


6.3- 10 6 



as when we compare x 2 and BLS. Indeed, if f(x) is the best 
function describing the shape of a transit, and if this function is 
even (which is best true for circular orbits and spherical stars), 
then the x 2 model is the first non-null term of the Taylor series 
of the optimal function f(x). In some cases, the term in x 4 might 
be more representative, but it will always be a more complicated 
description and it will be far from optimal for transits with higher 
impact parameters. The parabolic model is not the best model, 
but it is the most simple (with fewer free parameters), analytic, 
continuous (in contrast to BLS) description of the transit. 
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Fig. 3. Different models (x 2 , x A and BLS) for the light curve of 
Kepler-22b and comparison of the residuals. 
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Fig. 4. Periodogram of the DST statistic for the light curve of 
Kepler-22b. This analysis includes 682 days of data from quar- 
ters Q0 to Q6 and there are three transit events. 



3.2. Test case I: the test statistic for a giant planet 

It is difficult to compare the result of the test statistics of BLS 
and DST because they are defined in different ways and rep- 
resent different mag nitudes. The signal residue (SR) defined in 
IKovacs et alJ ([2002) is derived from expression ||5}, as has been 
shown in Section [2] For example, in our particular analysis of 
the light curve of CoRoT-lb, the value of the SR, defined as 
-\/s 2 /r(l - r) in IKovacs et al.l d2002l) . at the ephemeris of the 
transit has a value of 1.985 18 • 10 6 , whereas it has a value of 
1.985 14 ■ 10 6 outside the transit, which is a difference of about 
22 ppm. However, the statistic used in BLS is the signal detec- 
tion efficiency (SDE), built from the SR, which in turn has an 
on-transit value of 2.5, and this is the value used to determine the 
reliability of a transit candidate. The DST uses expression (IT4l . 
which is not directly comparable to the SR of BLS. However, 
in the case of CoRoT-lb, the value of this estimator on transit 
is 800 times high er than the value off transit (3.5 vs. 0.004). If 
we proceed as in IKovacs et al.l (120021) and define an equivalent 
to the SDE for the DST estimator (see Fig. [5), its value on transit 
is 6.4, more than doubling the performance of BLS. 

One of the disadvantages of the SDE statistic is that 
it includes the points in transit in the calculation of the 
reference standard deviation used to assess the signifi- 



cance of a signal. This is a known fact already discussed 
by different authors (see, for example , Kovacs et al. 2002; 
Schwarzenberg-Czerny & Beaulieu 2006; Jenkins et al. 2010b). 
For example, the WASP survey uses the BLS code for transit de- 
tection, but does not use the SDE as statistic to select planetary 
candidates, but a different s tatistic with dimensions of y 1 ( see 
ICollier Cameron et al .1120061) . 



3.3. Test case II: the test statistic for a terrestrial planet 

The difficulty of comparing the performance of different detec- 
tion algorithms rests in the definition of the test data set and in 
the optimization of the codes. Simulated data sets have the ad- 
vantage of handling the impact of the S/N ratio in the detectabil- 
ity of the signals. How ever, in practice all transit surveys suffer 
from correlated noise dPont et al.l 2006) that dominates the de- 
tectability of dim signals and the rate of false alarms. Therefore 
we chose for our analysis the measured light curve of CoRoT-7b 
as test data set. We used standard techniques for iltering stel- 
lar variability and instrumental residuals (see below). We did not 
control how much the red noise affected the detectability of the 
signal, but we assumed that it would be representative of the typ- 
ical impact of this kind of perturbation in transit surveys. Lastly, 
a fair comparison of different techniques requires that the con- 
figuration parameters used for the transit search algorithms are 
optimized with the same rigor. We have done our best to put 
both algorithms, BLS and DST, in the most similar conditions. 
Considering the results of the previous section, we assume that 
the differences in the performance of the analysis of CoRoT-7b 
that we present are indeed due to the advantages that DST en- 
tails over BLS and not due to an improper configuration of the 
optimizing parameters. 

Figure [5] shows the comparison of the SDE defined in the 
previous subsection for BLS and DST in the analysis of the same 
light curve of CoRoT-7b. First, DST provides a more robust de- 
tection, identifying correctly the right period and many harmon- 
ics up to periods of 20 days, and providing a less noisy result 
than BLS. Second, DST has an SDE of 35 compared to an SDE 
of only 24 for BLS. This test case shows that DST performs bet- 
ter than BLS also detecting the dim signals of terrestrial planets. 
Interestingly, the SDE for CoRoT-7b (35) is bigger than the SDE 
of CoRoT-lb (6.4). There is no reason to be surprised. The SDE 
is, by definition, a poor posed statistic to characterize the relia- 
bility of a signal. The standard deviation of the test statistic is 
included in the denominator of the SDE. Therefore, a very sig- 
nificant peak, such as that created by a transiting hot Jupiter, 
will have a significant contribution to the standard deviation of 
the data, ther efore reducing the s ignificance of the SDE (see the 
discussion in IKovacs et al.l 120021) . If we compare just the value 
of the test statistic of DST, it has a value of 3.5 in the peak and a 
background of 0.004 in the case of CoRoT- lb (a ratio of 800) and 
a value of 0.023 in the peak and a background of 0.0006 in the 
case of CoRoT-7b (a ratio of 38). As expected, the DST statistic 
reflects correctly that CoRoT-lb is a more significant detection 
than CoRoT-7b. Thi s is again a consequence of the definition by 
IKovacs e t al. (2002) of the SDE statistic. 



4. The transit search paradigm 

4.1. The period array 

The most common strategy to search for transit-like signals is 
to perform a blind analysis for an array of test periods. The 
extremes of the period array are defined according to the ex- 
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Fig. 5. Value of the signal detection efficiency SDE of BLS 
and DST for the light curve of CoRoT-7b. Detail of the region 
around the measured orbital period of this planet, indicated with 
a vertical line (0.853 585 + 0.000 024 days). 



pected detectability range: periods shorter than 0.5 days are not 
expected for planets around solar-like stars because of the vicin- 
ity to the host star (as discussed in the introduction). On the 
other hand, the longest detectable period depends on the length 
of the observations and the duty cycle. Finally, one has to de- 
fine the density of periods in the array. Not all a uthors describe 
their c hoices in the literature. Two exceptions are lClarkson et alj 
(I2007I) . who for the SuperWASP-North survey defined a pe- 
riod range between 0.9 and 5 days with a spacing of 0.002 
days for a coar s e sear ch and of 0.001 days for a finer search, 
and iFaedi et all (1201 ll) . who used a method equivalent to ours, 
although in t he frequency range. See also the discussion in 
Uenkins et alJ ( f!^l2010bh . 

The light curve is then folded to each value of the pe- 
riod. For each of these periods, the detection algorithm is run 
for every reasonable combination of transit epoch (from to 
the end of the folded light curve) and duration of the transit 
(which can be estimated, for example, from the expected rela- 
ti ve sizes of planets around solar-li ke stars using the expressions 
in Seager & Mallen-Ornelas 2003). The spacing in the arrays of 
epochs and durations can be adjusted considering the number 
of bins in the fold. This method implicitly assumes that there is 
only one transit per folded light curve. 

We propose a method to estimate the best density of the 
search arrays and an alternative to the blind search described 
above. 

The region in transit T of a folded light curve is defined as 
the points in the interval [xq - d/2,xo + d/2], where xq is the 
epoch of the transit and d its duration. If the light curve is not 
folded, the equivalent definition of the region in transit is the set 
of points that verify the condition: 

x - floorOt/P) ■ P 6 [x - d/2,xo + d/2]. (16) 

If we determine the period P of a transiting planet with an 
error AP and its epoch xq with an error Axo, we can constrain 
the position of the Mh transit xk with an error Ax^ = Axo + 
N • AP. The position of the Mh transit depends on the length of 
the observations I and typically N ~ I/P. Therefore, if we aim 
to constrain the position of the Mh transit with an uncertainty 
smaller than a certain value h (for example, comparable to the 
sampling rate), the acceptable uncertainty in the period has to 



fulfill the condition AP < h* P/I. This condition puts a physical 
constraint on the desired density of the period array. 

4.2. The region of interest 

The region of interest obviously is the transit. The most simple 
strategy is to search one transit-like feature per period, neglect- 
ing the presence of an occultation or secondary transit, which is 
indeed a reasonable assumption for ground-based surveys, and 
neglecting the presence of transiting systems of planets. The 
DST model allows (see definition [9]) to search for k transit- 
like features simultaneously without the need of folding the light 
curve. The definition of the region in transit 7* for the kth transit 
will be equivalent to ( [ToT l. substituting xq by xi, because each 
transit-like feature will have a different epoch. One can also use 
this system to search for secondary transit^ by setting k — 2. 
Including a second transit-like feature in the model is not a sig- 
nificant improvement for planet surveys, because planetary oc- 
cultations are orders of magnitude lower than primary occulta- 
tions (the temperature contrast between the star and the planet 
is big) and therefore the improvement of the test statistic will 
be small. But it can be justified in the case of eclipsing bina- 
ries, where the temperature contrast is comparable and therefore 
primary and secondary eclipse have comparable depths. This 
approach improves the detection and characterization of eclips- 
ing binaries, which are the main source of contamination in the 
search for transiting planets. 

4.3. Transit timing variations 

The shape of a transit is markedly affected when folding the 
light curve of a planet with significant transit timing variation 
(TTV) assumi ng a constant perio d (see, for example, the case 
of Kepler-9c, Hol man et al.l l2010). Although our algorithm per- 
forms reasonably well when detecting planets or planet can- 
didates with significant TTVs, the current configuration has a 
limited performance . See, for example, the case of KOI 1474 
dDawson et al.ll2012l) . a transiting Jupiter-sized candidate with a 
period of ~ 70 days and TTVs of up to lh from transit to tran- 
sit. We analyzed the original Kepler Q0 to Q6 data and calcu- 
lated the periodogram (see upper part of Fig [6]) and the TTVs. 
We used the value of the TTVs to artificially shift each transit 
from the original data to its expected position, if the orbit was 
exactly periodic, and we recalculated the DST statistic (bottom 
part of Fig [6]). It now shows a value at the period of the candi- 
date 25% higher. To overcome this effect, we propose another 
possibility that optimizes the search for planets with significant 
timing variations. If the observations span an interval / and we 
are searching for a planet with a period P, we expect k = I/P 
transits (assuming a full duty cycle). We can define k regions of 
search and minimize for each region with respect to the epoch 
of the Mi transit, which would be treated independently in the 
case of large timing variations. Each region k will cover a region 
of length P since the end of the previous region k — 1 . This in- 
creases the computational effort, but it provides the best descrip- 
tion for this case of scenarios. In the case of Kepler-9c the S/N 
ratio of the detection was high enough to surmount the degra- 
dation of the transit signal. But this might not be the case for 
small planets close to the noise level. For a given pe rturber, the 
timing variations will be stronger for smaller planets (Ago l et al.l 



4 the computational effort can be reduced by assuming circular orbits 
and fixing = xl + P/2, but this assumption is only reasonable for 
short periods. 
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KOI 1474: original periodogram 





1.4 r 




1.2 - 




1.0 - 




0.8 - 




0.6 - 




0.4 - 




0.2 - 




0.0 * 



50 100 150 

period (days) 

KOI 1474: artificial periodogram without TTV 
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ISchwarzenberg-Czernvll 19891 lJenkinsll2002l) . However, in prac- 
tice these limits are not easily applicable. The fundamental as- 
sumption that a given statistic follows a particular probability 
distribution function is that the errors affecting the observations 
can be treated as random Gaussian variables. Unfortunately, as 
we show below, this is not the case because most of the re- 
maining residuals, and for sure the most annoying, are always 
correlated to some degree, invalidating the previous assump- 
tion. Therefore, the threshold preventing the appearance of false 
alarms has to be set accordingly to the level of correlated noise 
remaining in the light curves, which is method dependent, and 
not merely to the expected value according to the theoretical be- 
havior of the statistic. All false alarms found in this paper are 
related to residuals of the stellar activity or instrumental residu- 
als (see below). 



Fig. 6. Comparison of the DST detection statistic for KOI 1474 
in the original data set (top) and once the transit timing variations 
were artificially removed (bottom). 
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Fig. 7. Comparison of the DST detection statistic for the plan- 
ets of the Kepler-36 system. Kepler-36c (above) with a size of 
3.7 Earth radii and a period of 16.2 days produces a clear detec- 
tion signal in the periodogram. Kepler-36b (below) with a size 
of 1.5 Earth radii and a period of 13.8 days, which shows signif- 
icant transit timing variations, produces a distinguishible signal, 
although limited by the non-periodicity of the transit. 



20051: iHolman & Murravl2005l:lBorkovits et al]201 ll) . Recently, 
Carter et all d2012f) mentioned a transit detection algorithm that 
accounts for variations between consecutive transits, similar to 
the idea described above, although the algorithm is not described 
in detail in their paper. We analyzed Kepler public data from 
quarters Q0 and Q9 with DST and are able to detect both planets 
with reasonable S/N ratio, although the detection of Kepler-36b 
is indeed limited by the presence of the TTV (see Fig . EJ . 

4.4. False alarms 

Every transit detection algorithm based on a test statistic to 
determine the presence of a transit must use a threshold to 
distinguish between real and spurious signals. If the proba- 
bility distribution function of the test statistic is known, it is 
straightforward to fix the threshold by allowing any desired 
value of spurious false positives. This procedure is widely dis- 
cussed in the literature in different contexts (see for example 



5. Future developments 

The previous section showed the improvements in the planetary 
detection performance brought by the better description of the 
transit shape and the better test statistic for both giant and ter- 
restrial planets. The only drawback of the DST formalism in 
comparison to BLS is that it involves a slightly heavier compu- 
tational effort. However, this is only a technical question, not a 
formal one, and moreover the increase in computational time is 
not prohibitive. Another advantage of our model is that it keeps 
the same number of free parameters as BLS, not like the trape- 
zoidal model. The drawback of the computational effort is only 
an inconvenience and perhaps can be overcome by changing the 
search strategy. The blind search requires to try, for every trial 
period, every epoch and duration (or every kth epoch and dura- 
tion), which is quite expensive and becomes worse with longer 
exposure intervals, such as those of the Kepler and PLATO sur- 
veys. We can overcome this difficulty by changing the paradigm 
of the search. One can assume that each point of the light curve 
is the center of a transit-like feature of a given length, therefore 
defining as many transits as points in the light curve (k = N). 
Then one can calculate the value of the expressions in (fTTt for 
each point and choose which subset of points of the light curve 
produces a significant change of the test statistic d!4l . If this 
subset of points corresponds to a periodic signal, or even mul- 
tiperiodic in the case of several transiting planets in the system, 
we have a positive detection. However, one cannot yet find such 
algorithms in the literature that can select the subset of points 
that produces a significant change of the test statistic. Searching 
for these particular algorithms is beyond the scope of this pa- 
per, which only aims to present the advantages of the model de- 
scribed in (0, but it will be the subject of future study. 

6. Data treatment 

We analyzed Ql data from the Kepler mission. In particular, 
we use the raw flux column (apjraw-flux) described in the 
Kepler data documentation 0. We did not use any filter for out- 
liers, relying on the Kepler pipeline for the filtering of cosmic 
rays. We analyzed each light curve with a Lomb-Scargle peri- 
odogram dScargldl 19821) to remove harmonic stellar pulsations. 
We then applied the stellar variability filter detailed below be- 
fore using the transit detection algorithm DST to search for the 
planetary candidates. 



In the 



the 



latest released 
flux column 



version 
called 



of Kepler data, 
sap_flux. See 



http : //keplergo . arc . nasa . gov/Documentation . shtml 
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6.1. Stellar variability filter 

The complex variability pattern of solar-like stars has been 
studied in the framework of space-based transit surveys such 
as CoRoT and Kep ler (see, for example Aig rain et al.l [2009; 
iGilliland et al.ll201 ID . The stellar variability filter described here 
is optimized for the removal of the relatively slow changes 
caused by the evolution of stellar spots modulated by the ro- 
tational period of the star. The amplitudes involved in this 
kind of activity are of a few percent in flux with time- 
scales of several days. This pattern is regularly observed by 
space-based telesco pes, where targ ets such as H P 189733 ob- 
serve d by MOST dcroll et alJ 120071) . Co RoT-2 dAlonso et all 
2008) , or HAT-P-11 observe d by Kepler dDeming et all 1201 li 
Sanchis-Oieda & Winn 201 1) are characteristic examples. Other 
activity patterns such as granulation and related phenomena, 
which have typical amplitudes in of 0.01 % to the ppm range and 
frequencies of hundreds of //Hz, are not treated here. 

We assumed that we can separate the incoming flux from the 
target into a stellar signal and a planetary transit signal plus some 
non-correlated residual noise: 

F(t) = F st Jt) + FtonsttCO + W(t). 

This neglects any contribution from instrumental features, which 
in reality will produce correlated residuals. In practice, the data 
were pre-treated to remove any previously known instrumen- 
tal signalQ and any remnant feature will remain in the residu- 
als (which therefore would not be treatable as random Gaussian 
variables, as discussed above). We assumed that the stellar sig- 
nal F stal presents some spot-induced activity signature, which 
has typical time-scales of days and amplitudes of few percent; 
the planetary transit contribution F ttanS i t is zero everywhere ex- 
cept during the transits, where it ha s the characteristic sha pe of 
a transiting planet (see, for example Mandel & Agol 2002). The 
time-scale of a planetary transit, which is of some hours, is dif- 
ferent from the typical time-scales of the spot induced stellar 
variability, therefore we aimed to define an algorithm that can 
separate both. To build a model of the light curve that is sensi- 
tive only to the long-term stellar variability, we binned the data 
adding up to «bin points per bin. This number was chosen for 
the purpose of averaging out as much as possible the signatures 
of transits while the number of points in the model was dense 
enough to resolve the stellar variability. 

Subsequently we used the Savitzky-Golay approach, which 
consists in assigning a polynomial model to the original data 
point s that fits the binned d ata better in a least- squares sense 
(see lSavitzkv & Golavlll964l and |Press et al.ll2002l) . We modeled 
up to « sca ie binned data points with a set of Legendre polynomi- 
als of degree «Le g - The compromise found between the number 
of points in the bins «bi n , the number of points modeled each it- 
eration n sca ie and the degree of the polynomials /i Le g governs the 
reaction of the model to the stellar variability and the presence 
of transits. 

A practical procedure to minimize the effect of transits and 
outliers is to remove the n lem worst points of the fit of the poly- 
nomial and recalculate the coefficients of the polynomial with 
"scale - "rem points. This procedure can be repeated several times 
("times)- Finally, we built F star with the values of the polynomials 
at every time measurement. The values for the different param- 
eters used in this work are summarized in Table [2] and the per- 
formance of the filter is shown in two examples named above: 
CoRoT-2b and HAT-P-1 lb (Figs.|S]and|5). 

6 for example, in the CoRoT context, any periodical variation at the 
orbital period of the satellite. 



Light curve of CoRoT 2b observed by CoRoT 
Savitzky-Golay filter 
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Fig. 8. Model of the stellar activity of the CoRoT light curve 
of CoRoT-2b. The inner box shows the detail of the modeling 
around one of the transits. The lower box shows the residuals of 
the modeling. 



Light curve of HAT-P-1 1 b observed by Kepler 
Savitzky-Golay filter 
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Fig. 9. Model of the stellar activity of the Kepler light curve 
of HAT-P-1 lb. The inner box shows the detail of the modeling 
around one of the transits. The lower box shows the residuals of 
the modeling. 

Table 2. Configuration parameters of the Savitzky-Golay filter. 



parameter 


Kepler data 


CoRoT data 


"bin 


5 


20 


** scale 


11 


11 


"Leg 


5 


5 


"rem 


1 


1 


"times 


3 


3 



The parameters of the filtering method used were optimized 
for CoRoT data and may not be ideal for the analysis of 
Kepler data. 

However, those parameters were optimized to preserve the 
shape of transits of short-period planets, such as the ones we 
expect to detect in the KepJerQl data set. They are not optimal 
for the search of planets with longer periods because they distort 
the shape of long transit features. For example, in the case of 
CoRoT-9b (shown in Fig.fTOTi we needed to use 40 points per bin 
to preserve the shape of the transit. 
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CoRoT-9b observed by CoRoT - Savitzky-Golay filter 
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Fig. 10. Model of the stellar activity of the light curve of CoRoT- 
9b with two different values for the configuration of the stellar 
activity filter. In the top panel, the configuration of the filter is 
not optimal and the filter tries to remove the transit signal. In the 
lower panel, the configuration was adapted to preserve the signal 
of these long transits. 



Fig. 11. Detail of the light curve of KIC 3340139 observed in the 
Q0 and Ql Kepler quarters. The continuous line represents the 
model of stellar activity together with the transit solution found 
by DST. Panels b-d show residuals of stellar activity that mimic 
a periodic transit feature. Panel a shows the expected position of 
the transit in Q0 data. 



Note that the nominal sampling rate of CoRoT is 512s in- 
stead of the nominal sampling rate of 30 minutes of Kepler, 
therefore «bin = 20 in the CoRoT sampling rate corresponds 
approximately to «bin = 5 in the Kepler sampling rate. 

On the other hand, Kepler reaches a higher S/N ratio level 
than CoRoT and many stellar activity patterns, which are too 
small to be important for CoRoT , become critical for Kepler. 
This is shown in Fig. [TTJ The Ql light curve of KIC 3340139 
shows three events approximately separated by 9.86 days (la- 
beled with the letters b, c, and d in the figure). If they were pro- 
duced by a transiting planet around a solar-like star, the size of 
the planet could be as small as 1.5 Earth radii. However, those 
events are not convincing enough: event b is in fact triggered 
by only three points ~ 2<x below the mean. Event c is probably 
caused by a residual of the stellar activity on a shorter timescale 
than the spot-induced variability. Event d is more similar to a 
planetary transit event, but the ingress and egress are not well 
defined and indeed there are similar features in the light curve 
which do not follow any particular periodic pattern. Event a cor- 
responds to Q0 data and clearly shows no transit-like feature at 
the expected position of the transit. We believe that the periodi- 
cal signal detected by the algorithm in KIC 3340139 is an artifact 
caused by residuals of the stellar activity that wer not properly 
handled by the filter because of their short typical timescales, 
comparable to the duration of a planetary transit. An important 
indication against the planetary origin of the events is that if 
the configuration of the stellar activity filter is changed, for ex- 
ample making «bin = 8, no periodical signal is detected in the 
light curve any more. Any detection that depends so strongly 
on the filtering method is not reliable. From another standpoint, 
the activity level displayed by KIC 3340139 is not unusual. The 
light curve of the LRaOl run from CoRoT 102584409 (a star 
with a similar magnitude to KIC 3340139, r' = 11.44 mag and 
K p - 1 1 .46 mag respectively) shows a similar activity pattern (see 
Fig.fTSI). but CoRoT is not sensitive to the small amplitude, short 
timescale patterns such as event c in Fig.QT] Figure[T3lcompares 
the noise level of these particular C oRoT and Kep ler targets 
following the procedure described by iPont et ali (120061) . which 
separates the contributions of the non-correlated (also known as 
white) noise, which evolves as 1 / ijn, being n the number of 



Comparison of the light curve of KIC 3340139 and CoRoT 102584409 
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Fig. 12. Comparison of the light curves of KIC 3340139 and 
CoRoT 102584409, which show similar activity levels. The po- 
sition of event c from Fig. QT|is marked with an arrow in the 
lower panel. 



binned flux values, and the correlated (also known as red) noise. 
The filter applied here was optimized for CoRoT data and is in- 
sensitive to the patterns displayed by KIC 3340139. We consider 
optimizing the algorithm filtering the stellar activity for small 
Kepler transits in future work. 

This limitation of the stellar activity filtering created a con- 
siderable number of false alarms in this data set compared to 
what is usual when analyzing CoRoT data (more than 100), 
which were removed by running the filtering method again with 
slightly different parameters and by comparing the results of the 
transit detection analysis in the new filtered data. Any detection 
that depended on the filtering method was rejected as a false 
alarm. As these are dependent on the stellar filtering method ap- 
plied and not on the detection method, they are not listed in this 
paper. 
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noise comparison between KIC 3340139 and CoRoT 102584409 
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Fig. 13. Analysis of the noise level of the light curves of 
KIC 3340139 and CoRoT 102584409. The typical noise level 
within lh in the CoRoT light curve is equivalent to the depth of 
a transit of a 1 .6Re planet around a solar-like star. The typical 
noise level for the Kepler light curve in lh is equivalent to the 
depth of a transit of a 0.87/?£ planet. The dotted line shows the 
1 / Vx expected white noise trend. 

7. Results 

As discussed before, one of the main conclusion of the stud- 
ies carried out by the CoRoT community when testing different 
transit detection algorithms was that both detections and f alse 
alarms were method dependent (Mou tou et al.ll20Q5l 120071) . In 
our analysis of the Ql data set we found that both statements are 
still true when comparing our results to those from the Kepler 
community. 

7.1. The yield of detections 

We found 43 planetar y cand idates analyzing 33 days of data 
that went unnoticed to IB1 lbl among them 8 mult iple transiting 
system s. Most of them were discovered later by Batalha et alJ 
d2012l) . who analyzed 480 days of data, but not all of them. There 
are still 15 planetary candidates, among them 3 multiple transit- 
ing systems, not discovered and published by the Kepler tearrQ. 
The advantage of having an algorithm that can detect new candi- 
dates in a significantly reduced data set is two-fold. On the one 
hand, the fewer transits needed to find a planet, the earlier can 
the follow-up begin, which is a critical point in the design of 
transit surveys because the resources are limited. On the other 
hand, the most interesting planets, such as terrestrial planets in 
the habitable zone of solar-like stars, have long periods, typically 
comparable to years, and will typically have few transit events. 
If we compare the distribution of our new candidates with those 
from Bl lbl we can see that the bulk of our detections has peri- 
ods between five and ten days (see Fig. RBI, which corresponds 
to candidates having six to three transit events in our data set. 

Table [3] contains the ep hemeris of the new planetary can- 
didates not foun d b vlBllbl nor found as eclipsing binaries by 
iPrsa et all (1201 ll) orlSlawson et all (120111) . Some of them were 
later found bv lBatalha et al.l ( 20121) : when this is the case, the 
KOI number assigned by the Kepler team is shown to identify 
them. The se candidates were ranked according to t he CoRoT 
rules (see ICarpano et al.l 120091: ICabrera etal] 120091) . Note the 



Fig. 14. Comparison of the depth and the period of the planetary 
candidates found in this work compared to those of lBl lbl 



pre sence of several planetary candidates similar to Co RoT- 
7b dLeger. Rouan. Schneider et al. 2009tlOueloz et al.l2009l) and 
Kepler- 1 0b dBatalha et al.l 120 1 lh in period and depth (they are 
marked with the label C7b-like). To detect the candidates we 
only used data from Ql, but to characterize the candidates and 
calculate the ephemeris we used public Kepler data from other 
runs to improve the ephemeris and confirm the detection. 

Table [4] contains the ephemeris of our false positives: de- 
tections ranked by the CoRoT rules that turned out to be back- 
ground eclipsing b inaries identif ied by the study of the cen- 
troid motion (Batal ha et al.ll2.Q10f) . The responsible contaminat- 
ing eclipsing binary or its position within the mask is indicated 
in the table. 

Finally, table [5] contains the ephemeris of the false alarms 
found in our analysis of the Ql data, which are discussed below 
in subsectionl7.3l 



7.2. Kepler candidates not detected by our pipeline 



We studiied the candidates found in lBl lbl that were not found by 
our analysis and discuss in the following sections possible rea- 
sons for these non-de tection s. Both data sets are not immediately 
comparable, because IbI lbl corresponds to four times more data 
than the analyzed here, and therefore we do not expect to find 
all. Still, we can learn about the performance of our algorithm 
and about the Kepler data set from these non-detections, as we 
detail below. 



7.2.1 . Candidates below the detection threshold 



while our paper was in the revision process, IOfir&Dreizie3 ( l20T2h 
published 84 new transiting signals in the Kepler data. 



In several cases, we found IB 1 lbl Kepler candidates below our 
detection threshold. The threshold was optimized according to 
the CoRoT experience and was fixed at a S/N ratio of 8 between 
the peak and the background. Several candidates, which could 
be clearly detected in the light-curves, produced periodograms 
with peaks immediately below the fixed threshold (see, for ex- 
ample, the case of the candidate KOI 532.01 from IBI lbl shown 
in Fig. [15]). In the absence of red noise, this detection threshold 
can be fixed for a particular detection algorithm by making the 
probabilit y of detecting a fal se alarm as low as needed (see, for 
example. iJenkins et al.ll20 02). However, in practice the presence 
of correlated residuals in the light curves distorts the expected 
distribution of false alarms and it is advisable to use a more con- 
servative value of the threshold. An illustrating case is shown in 
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riodogram of KIC 10454313 - KOI 532.01 - Q1 data 




15 20 
period (days) 



Fig. 15. Periodogram of KIC 10454313 showing the position 
of the peak corresponding to the planet candidate KOI 532.01, 
marked with an arrow. The height of the peak is 7.9, directly 
below the detection threshold (see discussion in text). 



periodogram of KIC 91 19458 - KOI 525.01 




detection threshold 



detection threshold 
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Fig. 16. Periodogram of KIC 9119458. The upper part corre- 
sponds to the analysis of the data set Ql and the peak of the 
candidate 525.01 is marked with an arrow. Its height is 5.4, well 
below the detection threshold. The lower part corresponds to the 
analysis of the data set Q2, where the height of the peak is 12.8 
(see discussion in text). 



Fig. [16] where the Ql data set alone, which includes three transit 
events, barely suffices to distinguish between the genuine peak 
from the candidate KOI 525.01 from lBl la For comparison, in 
the lower part of Fig. [16] the analysis of the data set Q2, which 
contains seven transit events, is shown. In this data set the peak 
clearly rises above the fixed threshold. Considering the level of 
the correlated residuals in the Ql data set, the threshold applied 
in the analysis presented in this paper seems to be too conserva- 
tive. 

7.2.2. Instrumental effects 

Among the candidates not found in our analysis there is a very 
particular case that is worth mentioning. The planet candidate 
from iBllbl KOI 559.01 (KIC 6422367) was not found in the 
Ql data set. We looked for the candidate in the Q2 data set and 
we were able to find it, but with a significantly different depth, 
700 ± 70 ppm, than the one reported in the original paper (214 



ppm). We pursued our analysis on this particular candidate by 
analyzing the latest data sets released by the Kepier team (see 
Fig. fT7T > and realized that the candidate was only present in the 
Q2 and Q6 data sets, not in Ql, Q3 and Q4 (there is no Q5 data 
for this target because of the failure of module 3). We note that 
from Q2 to Q6 the satellite performs a full revolution maneu- 
ver. This behavior is typical of a contamination from a back- 
ground eclipsing binary, which only pollutes the PSF of the can- 
didate in particular orientations of the CCD. We searched for 
an eclipsing binary that could be the origin of this contamina- 
tion in the field of view of Kepler and found that KIC 6422367 
has the same period and roughly the same phase as the candi- 
date. The periods of the candidate and the binary are 4.331 39 (5) 
days and 4.331 398(1) days respectively; the epochs are (HJD- 
2451 545.0) 121.722(4) and 121.73221 (7). The main counter- 
argument of this explanation is that this eclipsing binary is 1.8° 
away from the candidate (for example, during Q2, the eclips- 
ing binary was observed in module 1 1 in channel 34, whereas 
the planetary candidate was observed in the same module, but in 
channel 36). The probability that two independent targets have 
the same period and phase is extremely low. On the other hand, 
this phenomenon could have an instrumental origin. In support 
of this hypothesis, we have found two more examples of this 
behavior. The targets KIC 2830919 and KIC 3325239 show 
the same ephemeris as the eclipsing binary KIC 3836439 (see 
Fig. [TSV The first target is at 1.4° of the binary and the sec- 
ond at 0.8°. KIC 3325239 clearly shows the alternates of ap- 
pearance in quarters Ql and Q5, while it disappears in quar- 
ters Q2, Q3, Q4 and Q6 (see Fig. |T9l >. In space-based surveys, 
the contamination of neighbor targets by a bright eclipsing bi- 
nary, such as the contamination of KIC 9641031 of the targets 
KIC 9640985, KIC 9641008, KIC 9641041, and the planet can- 
didate KOI 7 12.01 (KIC 9640976, which is t hen a fa lse alarm de- 
spite of being ranked as a planet c andidate inlBl lbl see Fig . [20b, 
is we ll known (see, for example, Deeg et al. 2009; Batalha et alj 
120101) . but to our knowledge the type of contamina tion presented 
here h as not been documented before. Recently, Batalh a~et al.l 
d2012l) discussed the possibility that scattered light from bright 
eclipsing binaries contaminates the masks of neighboring tar- 
gets. However, these authors referred to targets within 20" of the 
binary, whereas the contamination that we have observed origi- 
nates in targets more than 1 arc degree away. The origin of this 
contamination deserves a more detailed study which is beyond 
the scope of this paper. The reas on for this behavior might be 
an effect named video cross- talk (IVan Cleve & Caldwelli r2009: 
J. Jenkins, priv. comm.). 



7.2.3. False positives from the Kepler pipeline 

In addition to the cases of KOI 559.01 and KOI 712.01 described 
above, there are other clear f alse alarms unnoticed among the 
planetary candidates in lBllbl 



- KOI 774 shows a clear eccentric secondary eclipse on the 
3 ■ 10~ 4 level (300 ppm), incompatible with the occultation 
of a p lanetary companion (see Coug hlin & Lopez-Morales! 
120121) . This candidate has be en classified as an eclipsing bi- 
nary bv lSlawson et alJ d201lh . 

- KOI 823 shows out-of-transit variations compatible with a 
massive companion and a secondary eclipse on the 7 ■ 10~ 4 
level, incompatible with the occultation of a planetary com- 
panion. Th is candidate was also clas sified as a potential false 
positive by[D emorv &Seager!(l2011h . 
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folded and binned light curve of KIC 6422367 (KOI 559) 



folded light curve of KIC 3325239 
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Fig. 17. Comparison of the light curves of the target 
KIC 6422367 folded at the period and phase of the planetary 
candidate KOI 559.01. The signal of the candidate is not visible 
in quarters Ql, Q3, and Q4 (not shown). See discussion in text. 



folded light curves of KIC 3836439 (top) 
and KIC 2830919 (middle) and KIC 3325239 (bottom) 
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Fig. 18. Comparison of the light curves of the targets 
KIC 2830919 and KIC 3325239 folded at the ephemeris of the 
eclipsing binary KIC 3836439. See discussion in text. 

- KOI 876 shows a secondary eclipse on the 7 • 10~ 4 level, 
incompatible with the occultation of a planetary compan- 
ion. This candidate was a lso classified as a false positive by 
iDemorv & Seageri d201 lh . 

- KOI 960 shows an eccentric secondary eclipse on the 5 ■ 10 4 
level, incompatible with the occultation of a planetary com- 
panion. Th is candidate was also cla ssified as a potential false 
positive by |d emorv &Seageri(l2011h . 

- KOI 1285 shows a secondary eclipse on the 3 ■ 10 level, 
incompatible with the occultation of a planetary companion. 
Additionally, it shows strong eclipse timing variations, indi- 
cating the presence of either stellar interaction or additional 
companions in the system. T his candidate has be en classi- 
fied as an eclipsing binary by Slaw son et al.1 (1201 1 | ) and also 
classif ied as a potential false positive by IDemorv & Seagerl 
(1201 lh. 



KOI 1401 displays the typical out-of-eclipse behavior of 
an eclipsing binary and has been classified as such by 
ISlawson et~ai1(l201lh . 

KOI 1448 is an active star, the light curve shows a secondary 
eclipse on the 8 • 10~ 4 level, incompatible with the occulta- 
tion of a planetary companion, and has been classified as an 



Fig. 19. Comparison of the light curves of the target 
KIC 3325239 folded at the period and phase of the eclipsing 
binary KIC 3836439. From top to bottom the data of quarters 
Ql, Q2, and Q5 are shown. The signal is only visible in quarters 
Ql and Q5. See discussion in text. 



folded light curves of the planetary candidate KOI 71 2.01 (KIC 9640976) 
and the eclipsing binary KIC 9641 031 
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Fig. 20. Comparison of the light curves of the planetary can- 
didate KOI 712.01 (KIC 9640976) and the eclipsing binary 
KIC 9641031. The left side of the figure shows the data of 
KOI 712.01 from the different quarters. The transit events are 
not seen in Ql, and the transit depth is not constant in quarters 
Q2 to Q6. The top part of the right side shows the light curve of 
the eclipsing binary and the lower part the data from quarters Ql 
to Q6 of 712.01 folded at the ephemeris of the eclipsing binary. 
The secondary eclipse is visible in this data set (marked with an 
arrow in the figure). 



eclipsing binary bv lSlawson et al.l (1201 lh and also classified 
as a false positive by Demorv & Seager T(l201 lh . 
KOI 1452 is another active star with a secondary eclipse on 
the 1 ■ 10~ 3 level, incompatible with the occultation of a plan- 
etary c ompanion, and has be en classified as an eclipsing bi- 
nary by Sl awson et al.1 (1201 lh . 

KOI 1459 is an eclipsing binary wit h a real period of 1.3 8 
days, twice that found bv iBllbl and ISlawson et all d201lh . 
The depths of the primary and secondary eclipses are 0.45 ± 
0.01 and 0.36 ±0.01. 

KOI 1541 shows a secondary eclipse on the 1 • 10~ 3 level, 
incompatible with the occultation of a planetary compan- 
ion, and has been classified as an eclipsing binary by 
ISlawson et ai1(l201lh . 
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- KOI 1543 shows a secondary eclipse on the 1 ■ 10~ 3 level, 
incompatible with the occultation of a planetary companion. 
It was classified as an eclipsing binary with the wrong pe- 
riod of 7.92 days by IPrsa et all d201 ll) . the real period is 
ha lf that or 3.96 days, but it is not in the list of binaries 
bv lSlawson et al. ( 201 lb although it was classified as a false 
positive bv lDemorv & Seagerl (1201 lh . 

- KOI 1546 shows out-of-transit variations compa tible with 
an ecli psing binar y and was classified a s such by IPrsa et al.l 
(1201 ll) . but not bv lSlawson et all (1201 lh . 

For completion, we add that the candidates KOI 4, KOI 51, 
KOI 131, KOI 135, KOI 138, KOI 201, KOI 206, KOI 211, 
KOI 225, KOI 256, KOI 261, KOI 340, KOI 687, KOI 741, 
KOI 822, KOI 913, KOI 976, KOI 1187, KOI 1020, KOI 1032, 
KOI 1227 , and K OI 13 85 have also been cla ssified either by 
IPrsa et al.l (1201 lh or by ISlawson et all (1201 lh as eclipsing bi- 
naries, and considering their light curves, they probably are. 
They all still appear as valid planetary candidates in the list of 
Batal ha et all (120121) . because the vetting of the first 1 600 KOIs 
has not been reprocessed with the improved method described 
in that paper which has been used for the most recent release 
of KOIo They represent a small fraction of the total number 
of candidates, but they are still readily identifiable by a care- 
ful analysis of the light curve. The careful tests applied by the 
Kepler pipeline could not remove these easily identificable false 
positives. One may wonder how many of the remaining candi- 
dates with lower S/N ratios, in which region those tests are less 
sensitive, will turn out to be also false positives. Authors should 
be extremely careful when interpreting the statistical analysis 
based on candidates without radial velocity c onfirmation or can - 
didates that have not been properly validated dTorres et al.l2 01 1 ). 

7.2.4. Other reasons 

In 14 cases either the stellar variability filter failed to remove 
the stellar signal without damaging the transit signal (for exam- 
ple, in the case of KOI 242, with a transit duration of almost 
6h, see Fig. |2T1 > or some residual outliers prevented the candi- 
date detection. As we mentioned above, we relied on the cosmic 
ray filtering of the Kepler pipeline. In some cases, remaining 
outliers in the light curve prevented the detection of candidates 
such as KOI 234 (see Fig.l22l with our algorithm, but these cases 
represent a minority of the non-detections. The stellar variabil- 
ity filtering is performing well, allowing the detection of most of 
the expected Kepler candidates, although it could be improved to 
enhance the detection of weaker signals of yet undetected transit 
candidates. 



7.3. False alarms from our pipeline 

The false alarms in Table [5] were identified because they could 
be detected in the Ql data set, but not in subsequent runs. We 
present as an illustrating example the case of KIC 1 1350341 (see 
Fig. [23) . The periodogram of the Ql data set has a distinct peak 
at the period of 9.3 days caused by thre transit events visible 
in the raw light curve. However, subsequent observations during 
the quarters Q2 to Q6 do not show the presence of this periodic 
signal. We do not find any nearby contaminating eclipsing bi- 
nary that could be responsible of the signal. However, there are 



KOI 242: transit detail; tambin=5 



KOI 242: periodogram (Q1 data); 1ambin=5 



while our paper was in the revision process, IOfir&Dreizieild2012h 
provided arguments to reclassify some additional KOIs as eclipsing bi- 
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Fig. 21. Light curve of KOI 242 filtered with the original con- 
figuration used for the analysis in this paper (top) and with an 
alternative configuration (bottom) that prevents the filtering of 
long transits. 
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Fig. 22. Above: periodogram of KOI 234 (KIC 8491277) in the 
original Ql data set. Below, the same periodogram after applying 
a 4<r clipping to the light curve. The candidate, which could not 
be distinguished from the noise in the original data set, is clearly 
detected in the filtered data set. 



some nearby variable stars that contaminate the mask. Possibly 
the stellar activity filter is not completely removing their contri- 
bution, and therefore the residuals produce the spurious detec- 
tion. We think that the Ql data suffers from more contamination 
from neighboring sources than subsequent runs, which could ex- 
plain why false alarms are only present in the Ql data. One of the 
reasons for this additional contamination would be the presence 
of a varia ble star among the guiding stars in Ql (see the dis- 
cussion in ICaldwell et aTll2010t lHaas et al.ll2010t Uenkins et al.l 
2010aj). Another possible reason are instrumental effects such as 
those described above in the case of KOI 559.01. 



8. Summary 

We have described a new algorithm for transit detection using 
an analytic model for the shape of a transit (Section[2]). We have 
compared in Section [3] the performance of the new transit de- 
tection technique, DST, with that of a widely used technique, 
the BLS algorithm, and shown that our algorithm performs bet- 
ter. We analyzed two representative test cases and we concluded 
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Fig. 23. Left: periodograms of the Ql (top) and Q2 (bot- 
tom) data sets of the target KIC 11350341. Only the Ql data 
show a periodic signal. Right: filtered and folded light curve 
of KIC 11350341. Ql data distinctly show a transiting signal, 
which cannot be recovered in Q2 data. 



that DST produces a better signal detection efficiency than BLS 
due to the improved description of the transit signal and to the 
improved definition of the test statistic in all relevant cases for 
transit surveys: terrestrial planets, giant planets, and eclipsing 
binaries. We discussed the advantages and the flexibility of this 
algorithm in defining the region in transit (Section|4]l, with their 
implications for the search for planets that experience significant 
transit timing variations (Subsection 14.3b and transiting multi- 
ple planet systems (Section|5]l. We intend to develop methods to 
make the computational effort required by this algorithm more 
effective in the future by changing the paradigm of the blind tran- 
sit searches, as it has been described in Section|5] 

We applied our algorithm and a set of tools to filter the stellar 
activity, which was designed for the CoRoT search of transiting 
planets, to the publicly released Ql Kepler data. As a result of 
this analysis, some adjustments were made to improve the per- 
formance of the stellar activity filtering. We report 15 planetary 
candidates, not reported previously in the literature, based on 
our analysis of 33 days of Kepier data. We discussed the im- 
pact of instrumental and stellar activity residuals in the data on 
the detection of planetary candidates. Our study shows that the 
analysis of space mission data advocates the use of complemen- 
tary detrending and transit dete ction tools for future space-based 
transit surveys such as PLATO (Catala 2009). 
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Table 3. Parameters from the planetary candidates. 



KIC 


KOI 


rank 


SNR 


Kp 


period 


epoch 


duration 


depth 


R, 


^eff 




comments 










mag 


[days] 


[HJD-2454 900] 


[h] 


[ppm] 


[R-] 


[K] 


[cgs] 




3128552.01 


2055.01 


3 


10 


14.523 


8.6822±0.0011 


109.4164±0.0060 


3.3±0.2 


500+60 


0.76 


5 530 


4.67 




4144576.01 


2202.01 


3 


12 


14.113 


0.813146±0.000002 


64.8261±0.0008 


1.5+0.1 


160+10 


0.74 


5123 


4.66 


CoRoT 7b-like 


4263293.01 


1895.01 


2 


8 


15.862 


8.45936±0.00074 


109.1483±0.0040 


2.5±0.2 


1300+170 


0.56 


4288 


4.73 




4263529.01 




2 


9 


14.031 


10.10243±0.00094 


106.3196±0.0058 


1.5+0.5 


1000+100 


0.89 


5 330 


4.53 




4278221.01 


1615.01 


2 


12 


11.524 


1.340907±0.000042 


105.4496±0.0013 


1.5+0.1 


120+10 


0.88 


5 534 


4.56 


CoRoT 7b-like 


4644952.01 


1805.01 


2 


8 


13.828 


6.94130±0.00052 


112.1919±0.0039 


1.6+0.1 


890+60 


1.62 


5 647 


4.08 




4917596.01 


1973.01 


2 


10 


15.753 


3.29063±0.00023 


105.6227±0.0020 


1.3+0.1 


780+120 


0.79 


4360 


5.49 




5080636.01 


1843.01 


2 


12 


14.404 


4.19411±0.00014 


109.0747±0.0014 


1.3+0.1 


820+70 


0.63 


3 673 


4.51 




5299459.02 


1576.01 


1 


13 


14.072 


13.0843±0.0010 


65.6296±0.0023 


3.1+0.1 


600+20 


0.91 


5214 


4.51 




5369827.01 


_ 


2 


11 


13.461 


3.60718±0.00013 


107.4454±0.0016 


1.2+0.1 


350+30 


1.16 


5 367 


4.33 




5371776.02 


1557.03 


1 


25 


14.840 


5.31590±0.00016 


65.4309±0.0008 


1.3+0.1 


850+40 


0.98 


5966 


4.49 




5371776.03 


1557.02 


1 


23 


14.840 


9.65335±0.00008 


67.8572±0.0019 


2.9+0.1 


1200+30 


0.98 


5966 


4.49 




5387843.01 


1762.01 


2 


12 


14.996 


0.870706±0.000002 


65.0811±0.0008 


1.2+0.1 


390+10 


0.93 


5 726 


4.52 




5443775.01 




2 


15 


12.936 


3.30751±0.00016 


107.5662±0.0026 


1.9+0.1 


260+20 


1.10 


5 782 


4.39 




5794570.01 




1 


14 


12.430 


5.44832±0.00025 


109.0765±0.0021 


2.6+0.1 


530+20 


0.56 


5 479 


4.92 


active star, P mt ~ 6d; peak to peak 3%. 


6153672.01 




3 


17 


15.337 


3.36477±0.00010 


108.401 1±0.0010 


1.3+0.1 


3300+200 


0.85 


5 600 


4.59 


pulsating bg star with period 0.31d 


6616218.01 


1692.01 


1 




12.557 


2.461 12±0.00030 


66.3108±0.0040 


1.6+0.1 


66+10 


0.96 


5224 


4.47 




6616218.02 


1692.02 


1 


15 


12.557 


5.96039±0.00002 


70.2641±0.0010 


2.9+0.1 


720+10 


0.96 


5 224 


4.47 




6696462.01 


1693.01 


3 


9 


14.980 


3.40377±0.00010 


107.6720±0.0032 


1.2+0.1 


530+70 


0.89 


5 384 


4.54 




7202957.01 




2 


9 


10.158 


1.71674±0.00007 


106.2430±0.0018 


2.0+0.1 


80+10 


2.02 


5618 


3.91 


active star 


7202957.02 




2 


9 


10.158 


8.16912±0.00058 


110.0257±0.0027 


3.4+0.2 


120+10 


2.02 


5618 


3.91 


id. 


8741367.01 




2 


17 


12.551 


0.98907±0.00010 


54.1067±0.0032 


1.9+0.1 


130+10 








CoRoT 7b-like 


9044228.01 


1988.01 


1 


14 


14.145 


0.934835±0.000002 


65.5200±0.0007 


1.3+0.1 


240+10 


1.66 


4604 


4.03 


CoRoT 7b-like 


9475552.01 




2 


16 


14.975 


0.843349±0.000002 


64.9733±0.0010 


1.4+0.1 


300+10 


1.11 


4659 


4.31 


CoRoT 7b-like 


9729691.01 


1751.01 


1 


14 


14.548 


8.6903 1±0.00094 


110.1064±0.0045 


2.4+0.1 


1200+100 


1.11 


5052 


4.34 


active star, P mt ~ 23d; peak to peak 1% 


9729691.02 


1751.02 


1 


18 


14.548 


20.9926±0.0013 


107.1019±0.0033 


2.7+0.2 


1100+100 


1.11 


5 052 


4.34 


id. 


9780149.01 




3 


14 


12.294 


2.51729±0.00039 


55.1147±0.0026 


1.2+0.1 


200+20 










9872283.01 


1815.01 


3 


12 


13.884 


3.12716±0.00011 


107.3675±0.0014 


1.6+0.1 


630+40 


1.80 


4680 


3.98 




9909735.01 


1779.01 


1 


17 


13.297 


4.66285±0.00011 


109.0435±0.0010 


3.0+0.1 


1580+30 


1.48 


5666 


4.15 


active star, P mt ~ 7. Id; peak to peak 2% 


9909735.02 


1779.02 


1 


13 


13.297 


11.81446±0.00079 


125.1682±0.0023 


2.7+0.1 


850+50 


1.48 


5666 


4.15 


id. 


9965957.01 


1911.01 


2 


9 


15.196 


5.75298±0.00054 


110.0284±0.0054 


1.7+0.2 


780+90 


0.87 


5 276 


4.55 




10190777.01 


1937.01 


3 


17 


13.590 


1.411471±0.000035 


106.5605±0.0011 


1.5+0.1 


380+20 


0.70 


4242 


4.55 




10332883.01 


1880.01 


3 


17 


14.440 


1.151123±0.000040 


105.3691±0.0016 


2.0+0.1 


470+30 


0.69 


3 898 


4.49 




10464050.01 


1851.01 


3 


17 


14.769 


4.46996±0.00022 


109.5373±0.0017 


1.4+0.1 


590+60 


0.82 


5712 


4.63 




11187436.01 




3 


13 


15.625 


5.90735±0.00005 


66.9356±0.0003 


1.2+0.1 


5500+100 


0.78 


4813 


4.58 




11298298.01 


1802.01 


2 


15 


13.345 


5.24518±0.00012 


110.5189+0.0011 


1.8+0.1 


620+30 


0.97 


5 868 


4.50 




11551692.01 


1781.02 


1 


24 


12.231 


3.005 148±0.000004 


55.0771±0.0004 


2.1+0.1 


630+10 


1.99 


4720 


3.91 




11551692.02 


1781.01 


1 


27 


12.231 


7.834483±0.000008 


60.3851 ±0.0003 


2.9+0.1 


1900+20 


1.99 


4720 


3.91 




11551692.03 




1 


8 


12.231 


58.017141±0.000030 


84.8529±0.0012 


4.6+0.1 


1250+20 


1.99 


4720 


3.91 





Table 3. continued. 



KIC 


KOI 


rank 


SNR 


Kp 


period 


epoch 


duration 


depth 


R s 




log g comments 










mag 


[days] 


[HJD-2454 900] 


M 


[ppm] 


[R-] 


[K] 


[cgs] 


11853878.01 


1833.01 


1 


30 


14.265 


3.69302±0.00001 


65.7330±0.0009 


1.6+0.1 


700+20 


0.60 


4 300 


4.67 


11853878.02 




1 


11 


14.265 


5.70959±0.00002 


70.1174±0.0013 


1.4+0.1 


550+30 


0.60 


4300 


4.67 


11853878.03 




1 


26 


14.265 


7.68426±0.00002 


66.3617±0.0008 


1.3+0.1 


970+40 


0.60 


4300 


4.67 


11911561.01 




3 


10 


13.579 


8.870498±0.000025 


70.4779±0.0008 


1.3+0.1 


630+20 
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Table 4. Parameters from the false positives. 





rank 




Kn 


period 


epoch 


duration 


depth 


coiiniieiiLS 








mag 


[days] 


rum TA^zLQOOl 
[ I 1J1J-Z <4Z>4 7UUJ 




[ppm] 




Zojjozo 



Z 


1 S 
lo 


1 1 SQQ 

1 1.077 


1 077/170-1-0 OOOO^O 
l.U / /4 /U+U.UUUU3Z 


1 O^ ^07zl-i-O OOl ^ 
1UJ. JZ /4±U.UU1 J 


o 
z.u 


zu / 


pixel (z,oj 


Lyy 1 1 lo 


D 


1 


1 GA 1 
IZ.oul 


c q^07Q_i_0 nnnQ^ 


i no 70 i n.n OOQ 1 

iuv. /Z1U+U.UU31 


^ 

J.Z 


/03 


pixel (z,j ) 


A ^70^^ 

4o /Ujjj 


o 

z 


1 3 


1 1.D4U 


/I 7/fQ/l-i.O OOl 1 
4. /4V4+U.UU1 1 


A7 A^QQ-uO OOQO 


Z.3 


j Id 


pixel (3,6) 


a aaaaaq 
4000UUo 


1 


n 
y 


1/1 A 1A 

14.434 


Z.Z4/yj±U.UUUZy 


i ha o 1 AA_i_n nn^o 
IUo.ZIoo+U.UUjZ 


O A 

z.o 


1 /I 1 
141 


■ v ,i /A 0\ 

pixel (o,Z) 


AQ i ^ooo 

4 V 1 3UUU 


o 

z 


1 £ 
10 


I 1 AAS 

I I .OOo 


A AA000 + C\ 0001 s 
4.44ZZZ±U.UUU 1 o 


1 OA ^QOI -i-O OO07 

iuo. jyui+u.uuzz 


1 
Z. 1 


1AO 
34Z 


pixel (,i,4^ 


^QOOOOA 
jJUZUUO 




Q 

y 


1 ^ o^o 


A 1 /l/lO-uO OO^ 1 
0. 144Z+U.UUJ 1 


Al< 7AG_i_0 01 A 
DJ. /00 + U.U14 


D.J 


1 7 1 Q 

1 / lo 


pixel \L,L) 




a 




1 ^ ^QA 

1 J.JOD 


O AOA 1 0^-uO OOOOQ^l 
U.4Z41Z3+U.UUUUV3 


A^ 1 Q 1 'Jj.n OO/IO 
OJ.1M3+U.UU4U 


o 

z.u 


on a 

Z /4 


n\v aA CO A\ 

pixel (,z,4j 


5894073 


1 


11 


13.748 


10.5953±0.0057 


69.9781+0.0068 


5.1 


511 


pixel (6,3) 


9307105 


3 


12 


15.023 


13.54510+0.00052 


108.9023+0.0024 


4.4 


1679 


pixel (1,4) 


12691412 


2 


14 


13.502 


6.9030±0.0025 


70.0241+0.0043 


3.4 


965 


pixel (7,4) 


3836453 


2 


17 


12.057 


1.540567+0.000056 


106.5835+0.0015 


2.5 


137 


contaminated by KIC 3836439 


3836522 


3 


15 


14.844 


1.54048+0.00010 


106.5888+0.0034 


2.3 


573 


contaminated by KIC 3836439 


3836399 


3 


14 


14.449 


1.54019+0.00032 


64.9981+0.0043 


2.3 


619 


contaminated by KIC 3836439 


2830919 


1 


8 


14.753 


1.54119+0.00044 


65.0032+0.0049 


2.1 


445 


contaminated by KIC 3836439 


3325239 


2 


9 


15.308 


1.54114+0.00036 


64.9837+0.0048 


1.9 


687 


contaminated by KIC 3836439 
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Table 5. Parameters from the false alarms. 



KIC 


rank 


SNR 


Kp 


period 


epoch 


duration 


depth 








mas 


Tdavsl 


[HJD-2 454 900] 




fnnrnl 


3246908 


3 


8 


14.756 


3.8713±0.0017 


66.168+0.036 


3.4 


443 


3443220 


2 


8 


13.615 


7.0007+0.0048 


71.1870+0.0094 


3.9 


472 


3446775 


2 


9 


10.949 


4.9819±0.0036 


66.168+0.011 


4.3 


219 


3544602 


2 


10 


15.011 


4.421 1±0.0016 


65.8624+0.0040 


2.4 


519 


3859729 


3 


9 


13.852 


6 4673+0 0018 


65 4454+0 0044 

V ' . 1^ 1 » ' 1 _!_ V./ . \J\J \ \ 


1.9 


759 


3962913 


2 


10 


13.729 


1.8433+0.0040 


64.851+0.039 


2.2 


176 


3964162 


1 


8 


15.331 


4.1584+0.0015 


65.2196+0.0064 


2.0 


838 


4275305 


3 


8 


12.573 


0.84275+0.00037 


65.3835+0.0082 


1.8 


83 


4364467 


2 


10 


14.653 


11.8808+0.0084 


70.875+0.014 


1.8 


1101 


4366351 


3 


8 


12.873 


11.2557+0.0061 


66.0898+0.0074 


5.9 


945 


5015478 


2 


8 


12.630 


5.5103+0.0022 


65.8242+0.0075 


1.9 


129 


6043914 


3 


8 


13.811 


2.54285+0.00095 


65.5311+0.0079 


2.4 


288 


6305938 


2 


9 


13.933 


4.4399+0.0027 


65.829+0.011 


2.1 


228 


7946652 


2 


12 


14.176 


6.5196+0.0018 


69.8381+0.0056 


2.1 


671 


8241503 


2 


9 


13.332 


3.2996+0.0032 


67.728+0.020 


2.6 


163 


8625295 


2 


10 


13.163 


1.45826+0.00032 


65.7306+0.0036 


1.8 


203 


9827003 


3 


10 


12.256 


1.64217+0.00042 


66.2096+0.0033 


1.8 


283 


10743519 


2 


8 


13.864 


5.7054+0.0017 


66.1704+0.0051 


2.6 


385 


11047067 


3 


9 


13.574 


6.8768+0.0031 


67.8136+0.0076 


2.3 


420 


11350341 


3 


8 


11.461 


9.2973+0.0089 


73.659+0.013 


5.8 


413 
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